SILT: Efficient transformer training for inter-lingual inference

نویسندگان

چکیده

The ability of transformers to perform precision tasks such as question answering, Natural Language Inference (NLI) or summarizing, has enabled them be ranked one the best paradigms address Processing (NLP) tasks. NLI is scenarios test these architectures, due knowledge required understand complex sentences and established relationships between a hypothesis premise. Nevertheless, models suffer from incapacity generalize other domains difficulties face multilingual interlingual scenarios. leading pathway in literature issues involve designing training extremely large but this causes unpredictable behaviors establishes barriers which impede broad access fine tuning. In paper, we propose new architecture called Siamese Inter-Lingual Transformer (SILT). This able efficiently align embeddings for Inference, allowing unmatched language pairs processed. SILT leverages siamese pre-trained multi-lingual with frozen weights where two input attend each later combined through matrix alignment method. experimental results carried out paper evidence that allows reduce drastically number trainable parameters while inter-lingual achieving state-of-the-art performance on common benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Inter-lingual Reference Approach For Multi-Lingual Ontology Matching

Ontologies are considered as the backbone of the Semantic Web. With the rising success of the Semantic Web, the number of participating communities from different countries is constantly increasing. The growing number of ontologies available in different natural languages leads to an interoperability problem. In this paper, we discuss several approaches for ontology matching; examine similariti...

متن کامل

Using inter-lingual triggers for machine translation

In this paper, we present the idea of cross-lingual triggers. We exploit this formalism in order to build up a bilingual dictionary for machine translation. We describe the idea of cross-lingual triggers, the way to exploit and to make good use of them in order to produce a bilingual dictionary. We then compare it to ELRA and a free downloaded dictionaries. Finally, our dictionary is evaluated ...

متن کامل

Cross-Lingual Type Inference

Entity typing is an essential task for constructing a knowledge base. However, many non-English knowledge bases fail to type their entities due to the absence of a reasonable local hierarchical taxonomy. Since constructing a widely accepted taxonomy is a hard problem, we propose to type these non-English entities with some widely accepted taxonomies in English, such as DBpedia, Yago and Freebas...

متن کامل

Restructuring the Inter-Lingual Index

This document reports on the restructuring of the Inter-LingualIndex to provide a more efficient mapping across the wordnets. By adding so-called Composite ILIs that group closely related senses it is possible to connect synsets that cannot be matched otherwise. Status of the abstract Complete

متن کامل

Training Factor Graphs with Reinforcement Learning for Efficient MAP Inference

Large, relational factor graphs with structure defined by first-order logic or other languages give rise to notoriously difficult inference problems. Because unrolling the structure necessary to represent distributions over all hypotheses has exponential blow-up, solutions are often derived from MCMC. However, because of limitations in the design and parameterization of the jump function, these...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems With Applications

سال: 2022

ISSN: ['1873-6793', '0957-4174']

DOI: https://doi.org/10.1016/j.eswa.2022.116923